Cultivated peanut (Arachis hypogaea L.) is an important grain legume providing high‐quality cooking oil, rich proteins and other nutrients. Shelling percentage (SP) is the 2nd most important agronomic trait after pod yield and this trait significantly affects the economic value of peanut in the market. Deployment of diagnostic markers through genomics‐assisted breeding (GAB) can accelerate the process of developing improved varieties with enhanced SP. In this context, we deployed the QTL‐seq approach to identify genomic regions and candidate genes controlling SP in a recombinant inbred line population (Yuanza 9102 × Xuzhou 68‐4). Four libraries (two parents and two extreme bulks) were constructed and sequenced, generating 456.89–790.32 million reads and achieving 91.85%–93.18% genome coverage and 14.04–21.37 mean read depth. Comprehensive analysis of two sets of data (Yuanza 9102/two bulks and Xuzhou 68‐4/two bulks) using the QTL‐seq pipeline resulted in discovery of two overlapped genomic regions (2.75 Mb on A09 and 1.1 Mb on B02). Nine candidate genes affected by 10 SNPs with non‐synonymous effects or in UTRs were identified in these regions for SP. Cost‐effective KASP (Kompetitive Allele‐Specific PCR) markers were developed for one SNP from A09 and three SNPs from B02 chromosome. Genotyping of the mapping population with these newly developed KASP markers confirmed the major control and stable expressions of these genomic regions across five environments. The identified candidate genomic regions and genes for SP further provide opportunity for gene cloning and deployment of diagnostic markers in molecular breeding for achieving high SP in improved varieties. 相似文献
Marker‐based prediction holds great promise for improving current plant and animal breeding efficiencies. However, the predictabilities of complex traits are always severely affected by negative factors, including distant relatedness, environmental discrepancies, unknown population structures, and indeterminate numbers of predictive variables. In this study, we utilised two independent F1 hybrid populations in the years 2012 and 2015 to predict rice thousand grain weight (TGW) using parental untargeted metabolite profiles with a partial least squares regression method. A stable predictive model for TGW was built based on hybrids from the population in 2012 (r = 0.75) but failed to properly predict TGW for hybrids from the population in 2015 (r = 0.27). After integrating hybrids from both populations into the training set, the TGW of hybrids could be predicted but was largely dependent on population structures. Then, core hybrids from each population were determined by principal component analysis and the TGW of hybrids in both environments were successfully predicted (r > 0.60). Moreover, adjusting the population structures and numbers of predictive analytes increased TGW predictability for hybrids in 2015 (r = 0.72). Our study demonstrates that the TGW of F1 hybrids across environments can be accurately predicted based on parental untargeted metabolite profiles with a core hybridisation strategy in rice. Metabolic biomarkers identified from early developmental stage tissues, which are grown under experimental conditions, may represent a workable approach towards the robust prediction of major agronomic traits for climate‐adaptive varieties. 相似文献
Objective: Lymph node metastasis leads to high mortality rates of oral squamous cell carcinoma (OSCC). However, it is still controversial to define clinically negative neck (cN0) and positive neck (cN1-3).
Methods: We retrieved candidate biomarkers identified by proteomic analysis in OSCC from published works of literature. In training stage, immunohistochemistry (IHC) analysis was used to determine the expression of proteins and logistic regression models with stepwise variable selection were used to identify potential factors that might affect lymph node metastasis and life status. Furthermore, the prediction model was validated in validating stage.
Results: We screened eight highly expressed proteins related to lymph node metastasis in OSCC and found that the expression levels of SOD2, BST2, CAD, ITGB6, and PRDX4 were significantly elevated in patients with lymph node metastasis compared to the patients without lymph node metastasis. Furthermore, in training and validating stages, the prediction model base on the combination of CAD, SOD2 expression levels, and histopathologic grade was developed and validated in patients with OSCC.
Conclusions: Our findings showed that the developed model well predicts the lymph node metastasis and life status in patients with OSCC, independent of TNM stage. 相似文献
MiRNAs are a class of small non‐coding RNAs that are involved in the development and progression of various complex diseases. Great efforts have been made to discover potential associations between miRNAs and diseases recently. As experimental methods are in general expensive and time‐consuming, a large number of computational models have been developed to effectively predict reliable disease‐related miRNAs. However, the inherent noise and incompleteness in the existing biological datasets have inevitably limited the prediction accuracy of current computational models. To solve this issue, in this paper, we propose a novel method for miRNA‐disease association prediction based on matrix completion and label propagation. Specifically, our method first reconstructs a new miRNA/disease similarity matrix by matrix completion algorithm based on known experimentally verified miRNA‐disease associations and then utilizes the label propagation algorithm to reliably predict disease‐related miRNAs. As a result, MCLPMDA achieved comparable performance under different evaluation metrics and was capable of discovering greater number of true miRNA‐disease associations. Moreover, case study conducted on Breast Neoplasms further confirmed the prediction reliability of the proposed method. Taken together, the experimental results clearly demonstrated that MCLPMDA can serve as an effective and reliable tool for miRNA‐disease association prediction. 相似文献
Aims
The accurate estimation of aboveground biomass in vegetation is critical for global carbon accounting. Regression models provide an easy estimation of aboveground biomass at large spatial and temporal scales. Yet, only few prediction models are available for aboveground biomass in rangelands, as compared with forests. In addition to the development of prediction models, we tested whether such prediction models vary with plant growth forms and life spans, and with the inclusion of site and/or quadrat-specific factors. 相似文献
Estimating the evolutionary potential of quantitative traits and reliably predicting responses to selection in wild populations are important challenges in evolutionary biology. The genomic revolution has opened up opportunities for measuring relatedness among individuals with precision, enabling pedigree‐free estimation of trait heritabilities in wild populations. However, until now, most quantitative genetic studies based on a genomic relatedness matrix (GRM) have focused on long‐term monitored populations for which traditional pedigrees were also available, and have often had access to knowledge of genome sequence and variability. Here, we investigated the potential of RAD‐sequencing for estimating heritability in a free‐ranging roe deer (Capreolous capreolus) population for which no prior genomic resources were available. We propose a step‐by‐step analytical framework to optimize the quality and quantity of the genomic data and explore the impact of the single nucleotide polymorphism (SNP) calling and filtering processes on the GRM structure and GRM‐based heritability estimates. As expected, our results show that sequence coverage strongly affects the number of recovered loci, the genotyping error rate and the amount of missing data. Ultimately, this had little effect on heritability estimates and their standard errors, provided that the GRM was built from a minimum number of loci (above 7,000). Genomic relatedness matrix‐based heritability estimates thus appear robust to a moderate level of genotyping errors in the SNP data set. We also showed that quality filters, such as the removal of low‐frequency variants, affect the relatedness structure of the GRM, generating lower h2 estimates. Our work illustrates the huge potential of RAD‐sequencing for estimating GRM‐based heritability in virtually any natural population. 相似文献